-
Notifications
You must be signed in to change notification settings - Fork 298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
li2_nc reader daskified #2985
li2_nc reader daskified #2985
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #2985 +/- ##
==========================================
- Coverage 96.10% 96.08% -0.02%
==========================================
Files 377 377
Lines 55147 55155 +8
==========================================
Hits 52997 52997
- Misses 2150 2158 +8
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚨 Try these New Features:
|
Pull Request Test Coverage Report for Build 11934839781Warning: This coverage report may be inaccurate.This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.
Details
💛 - Coveralls |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, just a comment or two inline.
So this PR is checking if an array is not daskified and converts it to a dask array if it's not. I think it would be important to know why the array was not a dask array in the first place, otherwise we are still possibly loading the data into memory maybe unnecessarily. Tracking back the satpy/satpy/readers/li_base_nc.py Line 409 in fa56be5
could you please check if they are not daskified here already, and if so, why? And otherwise, if they're always daskified coming from the file, we should find out where in the code they are computed and turned into numpy... |
@ameraner here is the result of my investigation. The code that you showed me bellow satpy/satpy/readers/li_base_nc.py Line 409 in fa56be5
returns either non daskified or daskified xarray DataArray. The reason of the "daskification" or not is to be found there According to what I have seen all the non daskified array are in fact cached_data which is caused due to the the length of the array* the lenght of dtype_size is inferior to Then some non daskified array are transformed into daskify in this part of the code it is depending of the transformation made. |
Hi @ClementLaplace , thanks for the analysis! I see, it's quite as @gerritholl was suspecting :) Ok, then I would say it is ok to keep this implementation, in order to make sure that we have homogenised dask array outputs for dataset. |
This pull request enables the daskification of the non accumulated and non transformed dataset.